|
|
Accession Number |
TCMCG075C27083 |
gbkey |
CDS |
Protein Id |
XP_017983265.1 |
Location |
complement(join(22378569..22378684,22378790..22380829,22380964..22381153,22382315..22382660,22382863..22382987,22383061..22383194,22384298..22384355,22389232..22389384,22389893..22390985,22391158..22391323,22391817..22391953,22392483..22392595,22394391..22394515,22395299..22395445,22395796..22395964,22396061..22396272,22396389..22396584)) |
Gene |
LOC18589558 |
GeneID |
18589558 |
Organism |
Theobroma cacao |
|
|
Length |
1839aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018127776.1
|
Definition |
PREDICTED: nuclear pore complex protein NUP214 isoform X2 [Theobroma cacao] |
CDS: ATGGCGAGCAGAGTTGAAATTGAGGAAGAGAAGGAAGGAGAGCATGTTGATACCACGGATTTCTTCTTCGAGAAAATCGGCGAACCCGTGCCTATCAAATCTGAGGAGGATTCCCTATTCGATCTCCGAAGCCCTCCACCTCAGGCCCTCGCCCTCTCCCAGCGTTTCCAGCTGCTATTTCTCGCTCATTCATCTGGGTTCTTGGTAGCGAGGACCAAGGATGTGATTAATTTGGCCAAAGATATTAAGGAAACTGGTTCTCCTTCTAGTATTGAGGATTTGAGTTTGGTGGATGTTCCCATTGGCAAGCTTCGCATTCTGGCTCTTTCCCCCACCGACGATTCCACACTTGCGGTTTCTGTTGCCGCTGATATTCATTTCTTCAATGTCAACACCCTTCTCAATAAGGAGATAAAGCCATGTTTTTCCATTTCGCTTCCTCAGTCAAGCTTTGTCAAGGATTTTCGATGGAGGAAGAAGAAGGATAACTCCTTCCTTGTTCTTTCAGATGATAGCAAGTTATATCATGGAACTCTTACTCACCCTCTTAAACATGTGATGGATAACGTCGATGCTGTTGAGTGGAGTGTAAAAGGTGCTTTTGTTGCTGTGGCAAAAGATGATAGTCTTAGTATTCTGTCAGCTAAATTCAATGAGAAGTTGTGCATGGTACTGCCATTCAAATCTTGGATTGGCGATTGCAATGGTGATTGCACTGTAAAAGTGGATACTATCAGGTGGGTTCGTCCTGATTGCATTGTTCTAGGTTGCTTTCAGTTTACTGCAGATGGTGAAGAGGAAAATTACCTTGTTCAAGTAGTCAAAAGCAAGACTGGCAAAATCACTGATGCTACTTCTGACTTGGTTGTGCTCTCCTTCAGTGATTTGTTTGCTGGTCTGATTGATGACATTGTGCCTTTCAGAACTGGACCTTATTTATTCTTGAGCTATTTGGAACAATGTGAGCTTGCAATTGCTGCTAACATAAAGAACACAGATCAGCATATTGTGTTACTCAGTTGGTCACTTGGTGAGACTGGTGAAGCTTCAGTTATTGATATTGAGCGTGATAATTGGCTCCCAAGGATTGAGCTTCAAGAGAATGGTGATGACAATTTGATCATGGGGCTTTGCATTGATAAAGTTTCTCTCTTTGGGAATGTGAAAGTCCAACTTGGAGTTGAAGAAGTCAAGGAACTTTCACCATATTGTGTTCTTATATGCCTTACTTTAGAGGGCAAGCTTATTATGTTCCACATTGCAAGTGTCACTAAAAATGCTGTTCCATTTGATGTTGCTGCCCATTCTGATAAAGAAGAGGATACCCCTGCTGTGGTACCTGAAGAATTTGATCTACCTAAACTTACTTATGGGCAGGGTGAGCAAAAGTCAGAACAGGTAACTTCGGTTCTTCCATTACTGGATCAAAGCAAAAAGGAGCTGCTTACTAATGGTAGTGAAATTCCTATTAAAAGTGATGTAAACCTTTCTGAAAGGAATGTGAACTCTGTTATGCATGCAACCAATGAAGCATTCGATAAGGATAATATTCAGAGATCAGTGTCTTTACAAATCTCCCAGTCTTTTGAAGCTGTTGGCCAGCAAAAACCTCCAACCACAAAGCCGCTCCAAGAAGCAGGCAGTCAACATAAATTGCTTTCTGGACAGCAAGGTACAAATTCAGGGCAATCATTTTTGAAGACTTCTCAACTAGAGGGACCTGGTAACAAGTTGAGGGATGGTAGTCAAACAGAAACTCAAAAGATTGCAGGAGTTGGATCTATTGCTTCTTTTGGAGGAAAATTTTCAAATGATACCTTAACACAACCAAACCATGAGAATGTACCAAAAAATTTTGAGCTGGTTAAGGAATCAGTAGGCAAAACTGGATCAATTGGATCGCAGAGTGCATCATTTCAGCCATGGCCAATTCCATCATCTCAGTCACTGATGAGTGGAAAACACATGCTTTCAGAGGAGTCTGATGCCAGATCTTCATTTTCACCTTCAAGTCATATTCAGTGCAGTAGATCTCTGGGTTCTGGAGTTACAATGGATACTACATGCATTTCTATTAGCAATGTTGGAAAACCTTCACATCTGAAAGATACTGCTGGGACATCAATTTCAGTCGATAAATTTTCAGGGAGACCAGTAGATACACAGAAATATTCAATGGGGGCAGGAAATATTGAGTCAGTACCTCTAATTTGTGGATCACAATTATCATCACAGCTAAATTTTGCACTGGAAAAGTCTCCCAACCAAAAGCTTTATCACCCCAAGGATGACTATAAATCTTCAACCCAGTCAGGGATGCGGACATCTGAACCACACTTATCCAAACAATTCAGCAATATTAGAGAGATGGCTGAAGAATTGGACACACTTTTGGAATCTATTGAAGAAACTGGTGGCTTTAGGGATGCTTGCACTGTTTACCAAAAGAGTTCAGTTGAAGCGCTGGAGAGGGGAATAGCTTTTCTTTCTGACAAATGCAGGAGATGGGAGAACATGATGGATGAGCATCTTGGGAAGATCCAGCATCTTCTTGATAAAACTGTTCAAGTTTTAGCAAGGAAGATATACATGGAAGGCATTGTTAAGCAAGCTTCTGATAGCCAATACTGGGACCTCTGGAATCGCCAGAAGTTGAGTTCTGAGCTTGAGCTTAAGCGGCGACATATACTGAAATTGAATCGGGATTTGACCAATGAGTTAATTGAATTAGAGAGGCATTTCAACACCTTTGAACTCCATAAATTTGGTGATAACAATGGAGTTGATGCAGGTTGGAGAGCTTTGCAGAGTAGATTTGGGTCTTCAAGACACATACAGTCTCTGCATACTTTACATAACACAATGAATTCACAACTAGCAGCTGCTGAGCAACTTTCTGAATGCCTCTCACAACAAATGGCCATGCTGAGTGTAGAGTCACCTGTAAAACAACAAAATGTGAAAAAGGAGTTGTTCCAAACAATTGGTTTAGCATATGATGCCTCTTTTACCTCTCCAGGTGTGACAAAACCTAGCAATACTTCTTCAGTGAAGAAACTTGTTCTTTCCTCTGGGTCCACTGCTTCTAGAATTCAGTCCAGGAGAAACCCATCCAGTGCTCTGAAGAGTTTTGACCCAGAAATTGCAAGGAGGAGGAGGGACTCGCTGGATCAGAGCTGGGCTAGTTTTGAGCCTCCAAAAACTACCGTGAAAAGGATGCTTTTGCAAGAATCAGCAAGTGTAAAAAGAACATCCTTCACAGATAAGCAGAACTTTAGCCCTTACGCTCCTGAGGAATCAACAAGTTCATTGTCAAAGGAACACCCAGCAACTTCAGCCATGTTCTATCAATCTGGAAAAGAAGGCACCCAGGATGCATTCCCAAAGCAGGAATCTGAATCAACCCTATTCAGATGGGCTAATAATTCTCTAGTTGCGCCACAATCTACTGGGTGGAACTCTTGTACAGTGCAAACAAGTAACTTCTCTGCTTTGTCATCAACATCAGGATCACAACCTATGGTGGTGCAGAATCGTTTGGGGGAAACTTGCAGTATTCCTGTTGCTAAATCAAACACTGGAGCTTCTCATCTTGAGAGGTTTAATAGTTCATCTTTCTATGAGAATGAGATTCAATTCACTCAACAGTTTAGACCTGATCTATGTCAAGAGTTATCAATCTCCCAGGTGGCTTCATTGCCAAAGAAATCCACAGACATCCCAAATTCAGATGGTAAAGGGACTGTGCTTGCAAATTCAGCCCTTGGGTATGTGAAACAGGTGCCATCAACCACAAAGAGTACACTTTTTGGTTCTTCTAACAATTATGACCCCCAGTTCATGCCCCCAGCTGCTGTTTCTGCATCTTCTACCCTTTCAGCAAAAGTTTCACAAGTCAATTTTATAAAAAGCAAAAGCCAGCCCAGTGAAAAGGTATCAGAGTCTTCTGCATTCTCAAAACCAGTTTCAGATTCATCATCAACTCTTTCATTATCATCATCATTTTCTACGGTGCCAACCTCATCAGTCACATCAATCCCCACATCAGTTTCTATGATGTCATCTGCAACAATGGGTTCATCATCGGCCCCAAACTTTTCCTTTTCAACCTCATTTTCAATCGTTTCATCTTCATCATCTGGAACACAATTCAGCGATTCTATGACAAGTTCTATAGTTTCTGCACATGCTAATAGAAAAGCATCATCTTCATCATCATCACTATCAATTTTTCCCTCTGCTGGTGTTTCCTCTTCAAATTCCCTCTCCATTCATCCTCACCAAATACCTGTGCCCTTCCCTTCTGATTCACCACCCGTAAGTTCACCATCAGAGATTTTGAAAACTGAGGCTCAACCACGTATGGAAACACTTGGCTTGAAAAAGAATGTCGATTCAATGACACAGGCACTACCACTGCAGCATGAACTTCCAGCAGCGGGATTGAGTTTGAAGCCAGAAGCTGCAGTGTCATCTTCACCGATATGTGAAACTCCTACCCGAATATCATCTGGAAGCCAGAGCAGCATCATTAATGTTGCAAGTCCTGCATCTAATTTGGCATCAAATGCTCATCCAGTGCAGCCTGCTACTGGAGATATTCTTTTTACTGCACCATTGTCAACTAGTATTAGCACCACTGATGGAAAAAGTGGAAGTTTGGATGTAACGGTTACACAAGAGGATGAGATGGAGGAGGAGGCTCCTGAGACAAACCAAAGGACTGAACTTAGCTTGGGAAGCTTAAGTGGCTTTGGCAATGGCTCAACTCCTAACCCAACTGCTCCTAAGCCTAATCCATTTGGTGCTCCATTTGGAATTGTGGCTCCACGTATGGCTAGCTCTTCGTTCACCACAGCTCTTCCTAGTGGAGAGTTGTTTAGGCCTGCATCTTTTAGCTTCCAATCTCCTCAGCCTTCCCAATTGGCTCACCCTGCAAATTTTGGCGCATTCTCTGGTGGCTTTGCTTCTAGCACATCTGGTCAAGCTCCTGCTCAAAGGGCTTTTGGCCAACCAGCACAGCTTGGAGTTGGACAGCAGGCTCTGGGATCAGTTCTTGGTTCTTTTGGGCAGTCAAGACAGATTGGTACTGGACTACCAGGAAGTGGTTTTGCTTCTGTGAGTGGTTTTGGAGGTGGTTTTGCAGGTTCTCAATCTGCTGGTGGGTTTTCTAATGCTGCAACAGGAGGTGGATTTGCTGGCGTTGCCTCTAGTAGTGGTGGCTTTGCTGCTTTGGCTTCAGGTGGTGGGGGATTTGGTGGTCTGGCTTCTGGTGGTGTTGGATTTGGTGGTCTGGCTTCTGGTGGTGGTGGATTTGGTCTGGGTTCAGCTGGAGGATTCACAGCTGCGGCCTCAGGTGGTGGTGGTGGAGGAGGATTTGCTGCTGCTGCCGCTGCTGCCTCAGGGGGTGGGTTTGGCGCTTTCAGCAGCCAGCAAGGGAATGGTGGTTTCTCAGCCTTTGGAGGTGGCGCAGGACAAACTGGCAAACCTCCTGAGCTTTTCACACAAATGAGAAGGTAG |
Protein: MASRVEIEEEKEGEHVDTTDFFFEKIGEPVPIKSEEDSLFDLRSPPPQALALSQRFQLLFLAHSSGFLVARTKDVINLAKDIKETGSPSSIEDLSLVDVPIGKLRILALSPTDDSTLAVSVAADIHFFNVNTLLNKEIKPCFSISLPQSSFVKDFRWRKKKDNSFLVLSDDSKLYHGTLTHPLKHVMDNVDAVEWSVKGAFVAVAKDDSLSILSAKFNEKLCMVLPFKSWIGDCNGDCTVKVDTIRWVRPDCIVLGCFQFTADGEEENYLVQVVKSKTGKITDATSDLVVLSFSDLFAGLIDDIVPFRTGPYLFLSYLEQCELAIAANIKNTDQHIVLLSWSLGETGEASVIDIERDNWLPRIELQENGDDNLIMGLCIDKVSLFGNVKVQLGVEEVKELSPYCVLICLTLEGKLIMFHIASVTKNAVPFDVAAHSDKEEDTPAVVPEEFDLPKLTYGQGEQKSEQVTSVLPLLDQSKKELLTNGSEIPIKSDVNLSERNVNSVMHATNEAFDKDNIQRSVSLQISQSFEAVGQQKPPTTKPLQEAGSQHKLLSGQQGTNSGQSFLKTSQLEGPGNKLRDGSQTETQKIAGVGSIASFGGKFSNDTLTQPNHENVPKNFELVKESVGKTGSIGSQSASFQPWPIPSSQSLMSGKHMLSEESDARSSFSPSSHIQCSRSLGSGVTMDTTCISISNVGKPSHLKDTAGTSISVDKFSGRPVDTQKYSMGAGNIESVPLICGSQLSSQLNFALEKSPNQKLYHPKDDYKSSTQSGMRTSEPHLSKQFSNIREMAEELDTLLESIEETGGFRDACTVYQKSSVEALERGIAFLSDKCRRWENMMDEHLGKIQHLLDKTVQVLARKIYMEGIVKQASDSQYWDLWNRQKLSSELELKRRHILKLNRDLTNELIELERHFNTFELHKFGDNNGVDAGWRALQSRFGSSRHIQSLHTLHNTMNSQLAAAEQLSECLSQQMAMLSVESPVKQQNVKKELFQTIGLAYDASFTSPGVTKPSNTSSVKKLVLSSGSTASRIQSRRNPSSALKSFDPEIARRRRDSLDQSWASFEPPKTTVKRMLLQESASVKRTSFTDKQNFSPYAPEESTSSLSKEHPATSAMFYQSGKEGTQDAFPKQESESTLFRWANNSLVAPQSTGWNSCTVQTSNFSALSSTSGSQPMVVQNRLGETCSIPVAKSNTGASHLERFNSSSFYENEIQFTQQFRPDLCQELSISQVASLPKKSTDIPNSDGKGTVLANSALGYVKQVPSTTKSTLFGSSNNYDPQFMPPAAVSASSTLSAKVSQVNFIKSKSQPSEKVSESSAFSKPVSDSSSTLSLSSSFSTVPTSSVTSIPTSVSMMSSATMGSSSAPNFSFSTSFSIVSSSSSGTQFSDSMTSSIVSAHANRKASSSSSSLSIFPSAGVSSSNSLSIHPHQIPVPFPSDSPPVSSPSEILKTEAQPRMETLGLKKNVDSMTQALPLQHELPAAGLSLKPEAAVSSSPICETPTRISSGSQSSIINVASPASNLASNAHPVQPATGDILFTAPLSTSISTTDGKSGSLDVTVTQEDEMEEEAPETNQRTELSLGSLSGFGNGSTPNPTAPKPNPFGAPFGIVAPRMASSSFTTALPSGELFRPASFSFQSPQPSQLAHPANFGAFSGGFASSTSGQAPAQRAFGQPAQLGVGQQALGSVLGSFGQSRQIGTGLPGSGFASVSGFGGGFAGSQSAGGFSNAATGGGFAGVASSSGGFAALASGGGGFGGLASGGVGFGGLASGGGGFGLGSAGGFTAAASGGGGGGGFAAAAAAASGGGFGAFSSQQGNGGFSAFGGGAGQTGKPPELFTQMRR |